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Abstract — Reed-Solomon (RS) codes over GF(2 m ) have tradi- 
tionally been the most popular non-binary codes in almost all 
practical applications. The distance properties of RS codes result 
in excellent performance under hard-decision bounded-distance 
decoding. However, efficient and implementable soft decoding for 
high-rate (about 0.9) RS codes over large fields (GF(256), say) 
continues to remain a subject of research with a promise of 
further coding gains. In this work, our objective is to propose 
and investigate 2 m -ary codes with non-trivial binary trace codes 
as an alternative to RS codes. We derive bounds on the rate of 
a 2 m -ary code with a non- trivial binary trace code. Then we 
construct certain subcodes of RS codes over GF(2 m ) that have 
a non-trivial binary trace with distances and rates meeting the 
derived bounds. The properties of these subcodes are studied 
and low-complexity hard-decision and soft-decision decoders are 
proposed. The decoders are analyzed, and their performance is 
compared with that of comparable RS codes. Our results suggest 
that these subcodes of RS codes could be viable alternatives for 
RS codes in applications. 

Index Terms — Reed-Solomon codes, soft-decision decoding, 
trace codes, bounds on codes. 

I. Introduction 

Reed-Solomon (RS) codes Q] are the most prevalent and 
commonly used codes today with applications ranging from 
satellite communications to computer drives. RS codes are 
popular, in theory, for their elegant algebraic construction. 
In practice, RS codes can be encoded and decoded with 
manageable complexity and high speed. RS codes continue 
to remain objects of active research with most recent interest 
being in list and soft-decision decoding (2) (5) flD . 

Efficient soft decoding of RS codes has traditionally been 
a problem of importance. Early methods for soft decoding 
of RS codes included Chase decoding and Generalized Min- 
imum Distance (GMD) decoding (5]. Other methods for soft 
decoding RS codes include J6)||7). Recently, the Koetter- 
Vardy algorithm Q, the belief -propagation-based iterative 
algorithm [8] and bit-level GMD algorithm |4) have been 
proposed. Common themes in the above methods include (1) 
an additional coding gain of less than 1 dB, (2) an increase 
in complexity with size of the field, and (3) an increase in 
complexity for higher coding gain. As a result, efficient soft 
decoders are not readily available for high rate (rate 0.9 and 
above) RS codes over large fields (GF(256) and larger) in order 
to achieve the « 2 — 3 dB of possible coding gain. The best 
coding gain achieved for the (255,239) RS code over GF(256) 
appears to be about 0.7 dB (at a block error rate of 10 -3 ) 
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over the additive white Gaussian noise (AWGN) channel. This 
is obtained with the bit-level GMD algorithm [4], which is 
a version of the GMD algorithm with bit-level erasures and 
Guruswami-Sudan list decoding. 

In this work, we provide an approach for improving cod- 
ing gain in the high-rate large-field case by exploiting the 
properties of images and traces of codes. Codes over GF(2 m ) 
are typically expanded into a binary image (using a basis for 
GF(2 m ) over GF(2)) before actual use in a physical channel. 
Hence, the binary image of codes over GF(2 m ) deserve to 
be studied closely. The binary trace code is closely associated 
with the image, since every image of a codeword over GF(2 m ) 
can be shown to be the concatenation of m codewords from 
the trace code. However, the algebraic structure and distance 
properties of the image (and trace, to an extent) have proved 
to be difficult to characterize over the years. For instance, 
determining the basis that results in an image of highest 
minimum distance (9) continues to be an open problem. 
Moreover, the exact practical utility (in terms of dBs of coding 
gain) of studying the properties of the trace and image have not 
yet been established concretely. One of the basic contributions 
of this work is to establish a possible utility, in terms of coding 
gain, for studying the trace and image. 

Specifically, in this work, we study codes over GF(2 m ) 
whose traces over GF(2) are non-trivial codes (not the identity 
code, for instance) with a minimum distance greater than 1. 
We characterize the structure of codes with a non-trivial trace 
and demonstrate properties that could be useful in practice. 
We derive some bounds on the minimum distance of the code 
and its trace using ideas of generalized Hamming weights 
flOl . These bounds allow us to study the constraints on the 
minimum distance of the original code imposed by a non- 
trivial trace code. 

On the practical side, we provide Reed-Solomon-like con- 
structions for codes with a non-trivial trace. Basically, these 
are subcodes of RS codes whose traces are binary BCH codes. 
Suitable non-consecutive zeros are added to the set of zeros 
of a parent RS code to enable the trace to be a BCH code. 
We show that these codes, which we call Sub-Reed-Solomon 
(SRS) codes, meet the minimum distance bounds derived for 
codes with a non-trivial trace. Hence, SRS codes have best 
possible distance properties. In addition, our analysis (using 
list decoders) shows that a large fraction of errors beyond half 
the minimum distance are correctable. Hence, the performance 
of SRS codes is comparable to that of a traditional RS code 
at the same rate. 

The main utility of SRS codes is that they are more 
amenable to efficient soft-decision decoding because of the 



2 



trace structure. Since the image of a 2 m -ary code is a con- 
catenation of its binary trace, a soft decoder for the trace 
can be efficiently used to process soft input for the image. 
Using this idea, we propose simple soft decoders for SRS 
codes. Our simulations show that the proposed soft decoders 
for high-rate (> 0.9) SRS codes over large fields (GF(256)) 
perform 0.4-0.5 dB better than other soft decoders of tra- 
ditional RS codes at the same rate. A coding gain of 0.7- 
0.8 dB is possible over traditional bounded-distance decoders 
with low-complexity soft decoders, which involve efficient soft 
processing followed by traditional bounded-distance decoding. 
The complexity of obtaining 0.7 dB of coding gain with a 
(255,239) SRS code over GF(256) is lesser than that of the 
bit-level GMD algorithm running on the (255,239) RS code. 
Hence, our results suggest that SRS codes could be competent 
alternatives to RS codes in certain situations. 

The rest of this article is organized as follows. Section [n] 
introduces the required notation and definitions for codes with 
a non-trivial trace. The basic structure of codes and their traces 
is shown in Section[ni] and the bounds on minimum distance is 
discussed in Section HVl SectionlVlintroduces the construction 
of SRS codes and derives interesting properties of SRS codes. 
Hard-decision list decoders for SRS codes and their error- 
correcting properties are studied in Section [VT] Section IVIII 
discusses three different soft-input decoders for SRS codes. 
Finally, concluding remarks are made in Section [Villi 

II. Notations and Basic Definitions 

See J9) for more details on the definitions and preliminary 
results in this section. A finite field GF(g m ) (q: power of 
prime) is an m-dimensional vector space over GF(q), Trace of 
an element a G GF(q m ) is a linear mapping Tr : GF(q m ) — > 
GF(g) defined by Tr(a) = J2i^o a<1 ■ The trace of a vector 
[a i a,2 ■ ■ ■ ) is [Tr(ai) Tr(a2) ■ • • ]. If C is a code over GF(q m ), 
the trace of C, denoted Tr(C), consists of the traces of all 
codewords of C. In general, Tr(C) is a (n, > k,< d) code over 
GF(q). The subfield subcode of C, denoted SS(C), is defined 
as C f](GF(q)) n . SS(C) contains the codewords of the g m -ary 
code C that are actually over GF(q). By Delsarte's theorem, 
we have 

(SS(C)) X = Tr(C x ). (1) 

A set of m elements of GF(q m ) linearly independent 
over GF(g) form a basis for this vector space. Let B = 
{Pi P2 ■■■Pm} be a basis for GF(q m ) over GF(q). Let 
B' = {P\ P' 2 ■ ■■P' m } be the dual basis of B such 



that Tr(ai/?j) 



Each element a £ GF(q m ) can be 



expanded as a = Y^iLi a iPi^ where a, = Tr(a/^). The 
image of a e GF{q m ) with respect to B is the vector 
Ime(a) = [oi a 2 • • ■ a m ] over GF(q). The image of C with 
respect to B, denoted Img(C), consists of the images (with 
respect to B) of all codewords of C. Image of an (n,k,d) 
linear code over GF(q m ) will be an (nm, km,> d) linear 
code over GF(q). 

For most cases in this paper, we restrict ourselves to GF(2 m ) 
for ease of description and practicality. Almost all our results 
have straight-forward extensions to GF(g m ). Also, in the 
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Fig. 1 . Structure of parity-check matrix for the image. 



context of this paper, an (n, k) code is said to be nontrivial if 

1 < k < n- 1. 

III. Structure 

Let C be a linear code of length n over GF(<7 m ), and let 
B={PiPi ■■■ p m } be a basis for GF(<f") over GF(g). Let 
B' = {P\ P' 2 ■■■ P' m } be the dual basis of B. 

Consider a codeword c = [c\ c 2 ■ ■ • c„] G C. The image of c 
is the vector [Img(ci) Img^) • • • Ime(c„)], where Img(ci) = 
[Tr(/3Jci) Tr(/3 2 Ci) • • • Tr(/3 r ' n Ci)]. For convenience, we view 
the image asanxm matrix with the i-th row being Img(cj). 

Proposition 1: Each column of an image matrix in Ime(C) 
is a codeword of Tr(C). 

Proof: The j-th column of the image matrix will be 

[Tr( Cl /3;)Tr(c 2 /3;)---Tr(c n /3;)]. 

ceC => p'jceC. 

Hence the j-th column will belong to the trace of C. ■ 
The above property establishes the importance and util- 
ity of a non-trivial trace of a g m -ary code. Basically, 
the image is a concatenation of codewords from the trace 
code with certain restrictions imposed by the overall code. 
As suggested by the concatenation, we let Ime(c) = 
[Tr(/3Jc) Tr(/3 2 c) • ■ • Tr(/3^c)], which is a permuted version 
of the image of c. The image of C is then defined as 
Img(C) = {Im B (c) :ceC}. 

The trace code imposes a structure on the party-check 
matrix of a q m -ary code with a non-trivial trace. 

Proposition 2: Let C be an (n, k) code over GF{q m ). Let 
Tr(C) be an (n, k') code over GF(q) with a n — k' x n parity- 
check matrix H' . Then there exists a n — k X n parity-check 
matrix H for C of the form 



H 



H' 
H" 



Proof: Since the rows of H' belong to Tr(C)^, by Del- 
sarte's theorem ([T}, the rows of H' belong to SS^C- 1 ) C C^. 
Since H' is a full-rank matrix over GF(g) (and hence over 
GF(q m )), the result follows. ■ 
The matrix H" will, in general, have entries from GF(g m ). 
Starting from the parity-check matrix of Proposition [2] we can 
obtain a parity-check matrix for Ime(C) with the form shown 
in Fig. [TJ In the matrix of Fig. Q] 
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H'/ = 



' Im B (A-K) 
Im e (/W 2 ') 

Im B (Ah£,_ fc 



1 < i < m, 



where h", 1 < j < k' — k denotes the j'-th row of H" . It is 
clear that a nontrivial trace code imposes a useful structure on 
the parity-check matrix of the image. In this work, we exploit 
this structure for efficient soft decoding. 



IV. Minimum Distance Bounds 

We begin with a well-known basic result on the minimum 
distances of a code, its image and subfield subcode. 

Proposition 3: If d, d ss and d{ are the minimum distances 
of C, SS(C) and Img(C), respectively, we have d < di < d ss . 

Proof: Clearly, di > d. Suppose c = [ci C2 • • • c n ] G 
SS(C) C C is a minimum weight codeword of SS(C). Since 
Ci G GF(<7), image of a/3i is 

[cMPiP'i) CiTr(A^) • • • CiTrifap'J] = [c,0- • -0]. 

Hence, weight of the image of (3iC G C is equal to the weight 
of c, and the result follows. ■ 



A. Generalized Hamming weight bound 

The standard Singleton bound states that d < n — k + 1 
for a (n, k, d) code C over GF(q m ). If we further require that 
the trace Tr(C) is a (n, k', d') code, with k! > k, d' < d, 
the additional structure in the parity-check matrix results in a 
stronger bound on d. 

The notion of generalized Hamming weights (GHWs), in- 
troduced in IflOl , is used in the bound. Let D be a subcode of 
a length-n binary code C. The support of D, denoted x{D), 
is defined as 

X (D) = {i:l<i<n, 3[ci c 2 • • • c„] G D : a + 0}. 

The set x{D) is the set of positions where not all codewords in 
D are zero. The r-th Hamming weight of C, denoted d r {C), 
is defined as 

d r (C) = mm{\x(D)\ : D is a (n, r) subcode of C}. 

In words, the r-th Hamming weight of C is the minimum 
support of a r-dimensional subcode of C. 

Proposition 4: Let C be a (n, k, d) code over GF(g m ) with 
Tr(C) being a (n,k') code (k' > k). Then, 



d < d k ,_ 



fc'-fc+l 



(Tr(C)) 



Proof: Let 



H = 



H' 
H" 



be a parity-check matrix for C as per Proposition [2] Let D 
be a (k' — k + 1) -dimensional subcode of Tr(C) with support 
X (D) such that |xCD)| = 4'-fc+i(Tr(C)). Let 



H D = 



H' D 
HZ. 



be the submatrix of H formed by the columns indexed by 
x{D)- The matrix H' D , which is a parity-check matrix for D, 
has rank rrj = \x(D) \ — (k' — k + 1). By row operations H' D 
can be reduced to the form 



rn 





p' 

D 





where I rD is the ro x ro identity matrix, Po is a rrj x (k 1 — 
k + 1) matrix, and represents all-zero matrices of suitable 
size. Therefore, by row operations, Ho can be reduced to the 
form 



ro 





p' 


p" 



1) matrix with entries 
1) -length vector v over 



where P£ is a (k' - k) x (k' - k 
from GF(<7 m ). Consider a (k' — k - 
GF(q rn ) such that -P^v T = 0. From the form of Ho above, 
it is clear that there exists a length-rrj vector u such that 
Ho[u v] T = 0. Hence, the vector with [u v] in the positions 
x{D) and zeroes for the remaining positions is a codeword of 
C with weight less than or equal to \x(D)\ = dfe/_fc+i(Tr(C)). 

■ 

For a (n, k) code C with a parity-check matrix H, another 
quantity closely related to generalized Hamming weights is 
the following, which is called equivocation with s erasures 
(0 < s < n) following (To): 

AJC) = min rank(iJ/), (2) 

where Hj denotes the submatrix of H formed by the columns 
indexed by /. A careful reworking of Corollary A (Appendix) 
in ED shows that A S (C) = A, < A < n - k for which 

n - dn-k-A+iiC^) < s <n — d n - k -&{C^) (3) 

holds. Hence, the equivocations of a code C can be computed 
using the generalized Hamming weights of the dual code C . 

Proposition 5: Suppose that C is a (n, k, d) code over 
GF(q m ) with Tr(C) being a (n,k',d') code over GF(q). Let 



H 



H' 



H' 



be a parity-check matrix for C such that H' is a 
parity-check matrix for Tr(C). Then, 

d > d" + A d (Tr(C)), 

where d" is the minimum distance of the (n,n — (k' — k)) 
code over GF{q m ) with parity-check matrix H" . 

Proof: Suppose c G C is a weight-d codeword with 
nonzero positions / C {1, 2, • • • , n], \I\ = d. Let Hi = ^h- 
be the n — k x d submatrix of H with columns indexed by /. 

By definition of A d (Tr(C)), we have that rank(i/j) > 
Ad(Tr(C)). By arguments similar to the proof of Proposition 
[4] we see that row operations will result in Arf(Tr(C)) columns 
of H" becoming zero, and d" < d - A d (Tr(C)). ■ 

Since H" has entries from GF(q m ), we could meet the 
Singleton bound and have d" = k' — k + 1 for several range of 
parameters. Assuming that the Singleton bound is met for the 
code with parity-check matrix H" , combining Propositions @] 
and [3] we get 



k' 



1 + A d (Tr(C)) < d < d fe -_ fe+ i(Tr(C)). 
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Hence, the generalized Hamming weights of Tr(C) and Tr(C) 1 " 
play a significant role in upper and lower bounding the 
minimum distance of a code with a non-trivial trace. 

B. Sphere packing bound 

For the sphere packing bound, we restrict ourselves to the 
binary case and set q = 2 for simplicity. As before, the image 
of a vector v = \v\ V2 • ■ -v n ] G GF(2 m )" is represented as 
a n x m binary matrix Irng(v) whose i-th row is Img(u ? ) = 

[Vn V t2 ■ ■ ■ V im ] 

denoted va. 



G GF(2). The j-th column of Img(v) is 



Let C be a (n, k, d) code over GF(2 m ) with t = [ 



^J.Let 

d! be the minimum distance of Tr(C) with t' = [^-^-\. The 
sphere around a codeword c G C is the following: 

S*(c) = {v : d H (v,c) < *} (J {v : c,-) < i'}. (4) 

l<j<m 

As in standard sphere packing bounds, the sphere includes 
vectors in GF(2™)" that are within a Hamming distance of 
t from the codeword c. In addition, vectors whose images 
contain columns that are within a Hamming distance of t' from 
the corresponding column of the image of c are included in 
the sphere. 

Let Si = {v : d#(v, c) < t} and S 2 = {v : dH(vj 7 Cj) < 
< j < m)}. We see that \Si\ = £L (™)(2 m - 1)' and 

( / \ m 

ELo(?)J • For t > mf, |Sin Sal = 1521. 
For t < mi', some additional combinatorics is involved in 
the computation of Si flS^I- Let v*, 1 < i < m, be uniformly 
and independently chosen n-bit vectors of weight at most 
tf. Let the random variable Xj = wt(vi OR V2 OR • • • v,), 
where wt denotes Hamming weight and OR denotes bitwise 
binary OR. The probability mass function (PMF) of Xi can be 
iteratively found starting with Xi and ending in X m . Finally, 

\s 1 ns 2 \ = \s 2 \Pv{x rn <t}. 

In either case, we let \Si (~1 S%\ = A nm (i, i')|/f>2|, where 
X nm (t,t') = 1 for t > mf and \ nm (t,t') = Pr{X m < t} 
otherwise. Hence, |5(c)| = (1 — A„ m (t,t'))|S , 2| + |5i|, and 
the sphere-packing bound becomes 



2"- fe > (1-A nm (*,*')) 



£(")) + S(^-* 



(5) 

Asymptotically, (EjLo (?))"' tends to 2 nmff2 (*'/«) and 

E*=o (™)( 2 ™ - !)' tends to 2™ mff2m 0/") ; where ff 9 (x) = 
xlog g (g — 1) — xlog ? x — (1 — x) log ? (l — x) is the entropy 
function. If H 2 (t' jn) > H^mit/ri), the first term dominates 
the bound, else the second term is dominant. Note that 
^nm(t,t') — 1 when t > mt' and the first term becomes 
zero. 

C. Existential bounds 

Bounds analogous to the Gilbert- Varshamov (GV) bound 
can be obtained for codes with a non-trivial trace. The tra- 
ditional GV bound states that a (n, k, d) code over GF(2 m ) 



exists whenever 



(2 m ) 



i—k 



> 



d-2 

E 



n- 1 



(2 m - l) 1 



The RHS above is an upper bound on the number of (n — k)- 
tuples over GF(2 m ) that cannot be chosen as the n-th column 
of a parity-check matrix for a (n, k, d) code. The i-th term 
in the RHS is the number of linear combinations of i of the 
already-chosen n — 1 columns. 

When the (n, k, d) code has a (n, k', d') trace, the form of 
the parity-check matrix results in a different upper bound on 
the tuples to be avoided in the n-th column. In this case, the 
parity-check matrix has the form H = , where H' is a 
n — k' x n binary matrix and H" is a k — k x n matrix over 
GF(2™ 1 ). Let us suppose that n — 1 columns of H have been 
constructed and we attempt to add the n-th column. For H' , 
the constraint to maintain a distance dl is the following: 



-yn — k 



d'-2 

^ E 

i=0 



n — 1 

i 



(6) 



Suppose the n-th column of H' , denoted h' n , has been chosen 
satisfying the above constraint. The number of (k' — fc)-tuples 
over GF(2 m ) to be avoided in the n-th column of H" can 
be bounded as follows. Consider a set / C {1,2,- •• ,n}, 
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Let Hi 



H" 



be the submatrix of H with columns 
indexed by I. For 1 < i < d' — 2, no linear combination of the 
columns of Hj can result in h' n by (O. For d! — 1 < i < d— 2, 
since the column rank of Hj is at least dl — 1, a maximum of 
(2 m — l)'~( d _1 ) linear combinations can result in h' n . Hence, 
an n-th column can be added for H", whenever 



(2 m ) k '- k > 



d-2 

E 



n - 1 



i=d'-l 

Combining © and (O, we get 



(2 m - iy 



-{d'-X) 



(7) 



d'-2 



,-k+(d'-l) log 2m (2™ - 1) > log 2 



i=0 



n- 1 



d-2 



i og2m ( J2 r^ 1 ) 1 



\.i=d'-l 



An asymptotic version (large m, n) of the above bound, with 
R = k/n, is the following: 



1 - R + - > ff a » - + ^2 - 
n V n / V n 



(9) 



D. Illustration of bounds 

The GHW bound is difficult to compute in the general case, 
since strong bounds for generalized Hamming weights do not 
exist when the dimension grows with blocklength. In fact, the 
Singleton bound is seen to be tight in this case ifTTl . 

For our purposes in this work, we compute the bounds 
discussed in this section for the case when (1) n = 255, 
df = 3, k' = 247 and (2) n = 255, d! = 4, k' = 246 
over GF(256). For this case, the corresponding trace codes 
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are the (1) (255,247,3) binary Hamming code and the (2) 
(255, 246, 4) even-weight subcode of the Hamming code. For 
the Hamming code, generalized Hamming weights have been 
found exactly in EH). Let C be the (n = 2 m -l, 2 m -m-l,3) 
Hamming code. The generalized Hamming weights of C are 
given by the following ordered set: 

K(C) : 1 < r < 2 rn -to- 1} = 

{1,2,-- • ,ti} \ {2 l : < i < to}. (10) 

If C is the even-weight subcode of C, the dual of C is the 
punctured Reed-Muller code. Hence, 

K(C") : 1 < r < 2™ - to - 2} = 

{2,--- ,n}\{l + 2 i : < i < m}. 

The bounds for the above two cases are shown in Fig. 
|2] These bounds hold for any code over GF(256) with 
n = 255 and a binary trace of minimum distance d! = 3 
and dl = 4. The marks 'x' represent the standard Singleton 
bound d < n — k + 1 without considering trace. The circle 
marks with legend 'SRS' represent points that are achieved by 
certain Subcodes of Reed-Solomon (SRS) codes that will be 
constructed in the later sections of this article. We see that the 
generalized Hamming weight bound is close to the standard 
Singleton bound. Hence, codes that achieve the generalized 
Hamming weight bound could be called 'trace-MDS'. The 
existential lower bound and the sphere-packing upper bound 
are shown as dotted lines in the figure. 

From Fig. |2 we see that the trace code does not significantly 
affect the rate when the minimum distance of the code (d) is 
reasonably larger than the minimum distance of the trace (d'). 
As can be expected, the upper and lower bounds are not very 
tight. This is because of the loose bounds on the combinatorial 
quantities in the derivation of the bound. These bounds could 
be improved in future work. 

The points corresponding to SRS codes shown in Fig. |2 are 
seen to correspond to optimal codes over GF(256) with a trace 
code of minimum distance 3 and 4. 

V. Sub Reed-Solomon Codes 

In this section, we discuss the construction and basic 
properties of sub Reed-Solomon (SRS) codes with a nontrivial 
trace. We restrict ourselves to images of GF(2 m ) over GF(2) 
for simplicity. The construction easily extends to the general 
case. We will see that the SRS codes are trace-MDS in some 
cases, where the GHW bound can be evaluated. 

A. Construction 

Let a be a primitive element of GF(2 m ). Let C z (t) denote 
the (n, n — 2t, 2t + 1) i-error correcting Reed-Solomon (RS) 
code of length n = 2 m — 1 with zero set be Z rs = {z, z + 
1 , • • • , z + 2t — 1 } mod n. The generator polynomial of C z (t ) 
is n?lo 1 ( :z; + aZ+l )- Typically, we let z = or z = 1. 

A SRS code C zz ,{t,t') (for t' < t) is a subcode of C z (t) 
with zero set Z rs U Zb c h, where Zbch is the zero set of a 
t' -error-correcting binary BCH code i.e. 

Zbch = C z i U C z i + \ U ■ ■ • U Cj' + 2t'-i, 




(a) d' = 3 
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(b) d' = 4 

Fig. 2. Illustration of bounds for n = 255, m = 8. 

where Ci denotes the cyclotomic coset of i modulo n under 
multiplication by 2 and z' e Z rs . In the simplest examples, 
we choose z = z' = 1 and denote Cn(t, t') as simply C(t, t'). 
In some cases, we pick z = and z' = 1. 

Example 1: Let a be a primitive element of GF(256). 

1) C(8, 1) is the subcode of the 8 -error-correcting 
(255, 239, 17) RS code (C(8)) with zeros 
{1,2,- ■• ,16,32,64, 128}. C(8,l) is a (255, 236, 
> 17) code. 

2) C(8,2) is the subcode of the 8-error- 
correcting (255, 239, 17) RS code with zeros 
{1, 2, • ■ ■ , 16, 24, 32, 48, 64, 96, 128, 129, 192}. C(8, 2) 
is a (255, 231, > 17) code. 

3) C(6, 1) is the subcode of the 6-error- 
correcting (255, 243, 13) RS code with zeros 
{1,2,- •• ,12,16,32,64,128}. C(6, 1) is a (255, 
239, > 13) code. 

4) Coi(6, 1) is the subcode of the 6-error- 
correcting (255,243,13) RS code with zeros 
{0, 1, 2, ■•■ ,11,16,32,64,128}. C i(6,l) is a 
(255,239, > 13) code. 
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B. Properties 

The following properties can be proved for the SRS code 
C zz ,{t,t') of length n = 2 m - 1 over GF(2 m ). 

Proposition 6: The trace of C zz >(t,t') is the binary cyclic 
code with zero set Zb c h U Z' rs , where Z' rs C Z rs is the largest 
possible union of cyclotomic cosets contained in Z rs . 

Proof: This follows from Delsarte's theorem (Q~|i and J9] 
Chap 7 (Problem 33)] ■ 
Thus, by Proposition Q] we see that when a codeword of the 
binary image of C zz / (t, t') is written down as a n x m matrix, 
each column will belong to the t' -error-correcting binary BCH 
code. When z = 1, the trace will be equal to the BCH code 
in most practically relevant cases. However, when z = 0, the 
trace will be the even-weight subcode of the t' -error-correcting 
BCH code. 

We now state a simple result about the subfield subcode of 
C zz '{t,t'). This result is useful in finding the exact minimum 
distance of SRS codes. 

Proposition 7: The subfield subcode of the SRS code 
C zz i(t,t') is the binary cyclic code of length n with zero set 
I J c7 ii7 C s . If z = z' — 1, the subfield subcode is the 
i-error correcting BCH code with zeros U s€Z C s . 

Proof: This follows from (9] Chap 7 (Problem 33)]. ■ 
As an example, consider the (255, 239, > 13) code C(6, 1) 
over GF(256). The trace of the code is the length-255 binary 
Hamming code. The subfield subcode is the 6-error-correcting 
length-255 binary BCH code with exact minimum distance 13 
fP2l . Hence, C(6, 1) is a (255, 239, 13) code over GF(256). 
The (255,239,> 13) code C i(6,l) over GF(256) has trace 
equal to the even-weight subcode of the length-255 binary 
Hamming code. Note that the minimum distance of the trace 
of C i(6, 1) is 4. 

Table J] summarizes the parameters for some SRS codes that 
could have possible applications in practice. 

The first four codes in Table Q] (with tf = 1) meet the 
generalized Hamming weight bound and are MDS under 
the trace constraint. In general, for n = 255, m = 8, 
d! = 3 and d = 2t + 1, the zero set works out to be 
Z = {1,2,4,8,16,32,64, 128} U {1,2,3, ••■ ,2t}. Hence, 
\Z\ = 8+2t-([log 2 (2t)J+l andd = n-k+ [_log 2 (d-l)J -6. 
Similarly, for d! — 4, we get d = n — k + [log 2 (c? — 2) J — 6. 
Therefore, the SRS codes have minimum distances close to the 
Singleton bound, particularly as d increases. For both dt = 3 
and d! = 4, these codes can be easily shown to meet the GHW 
bound. 

When the additional trace structure of SRS codes is used 
in the decoding, SRS codes turn out to be good competitors 
to RS codes offering good trade-offs between coding gain and 
complexity. 

VI. List Decoders and Error-Correcting 
Properties 

Since the minimum distance of the SRS code C(t,t') of 
length n = 2 m - 1 symbols over GF(2 m ) is 2t + 1 in 
most cases, algebraic bounded distance decoding does not 
appear to be promising. Also, algebraically the trace operator 
is difficult to handle in a Berlekemp-Massey-like decoder 



based on simplifying power sums by Newton's identities. 
However, utilizing the structure of the image in a list decoder 
is beneficial as described below. Using the intuition gained 
from list decoders, we propose several soft decoders in later 
sections. 

Though an SRS code has a lesser minimum distance than 
an equal-rate RS code in many cases of interest, simple list 
decoders can be designed to correct a significant fraction of 
errors above half the minimum distance. In this section, we 
introduce and study list decoders for SRS codes, primarily as 
a means for studying the error-correcting capability of SRS 
codes. 

A. List decoders 

Consider the SRS code C(t,t') over GF(2 Tn ). As seen 
before, every codeword of the binary image of C(t, t') can be 
written down as a n x to matrix with each column belonging 
to the t' -error-correcting binary BCH code. 

The proposed list decoder works as follows. The input to 
the decoder is the n x m matrix R of received bits. Let 
denote the 7-th column of R. The first block of the decoder is 
a bounded-distance decoder for the i'-error correcting binary 
BCH code of length n. The BCH decoder runs on each column 
Ri, 1 < i < m. The output of the i-th BCH decoder is denoted 
R^. In case of decoder failure, R t = Let R denote the 
?? x ?77 matrix whose i-th column is R { . The next step in 
the decoding is performed by a bank of L t-error-correcting 
bounded-distance RS decoders. The i-th decoder (1 < i < L) 
is parametrized by a set Si, which is a subset of {1, 2, ■ • ■ , to}. 
The input to the 7-th RS decoder is a n x to matrix whose 
j-th column is Rj if j € Si or Rj if j ^ Si (1 < j < to). 
The matrix is converted to a n x 1 vector over GF(2 m ) for 
decoding by the 7-th RS decoder. 

Note that the set Si specifies the columns that are decoded 
by the t' -error-correcting binary BCH decoder before input to 
the i th RS decoder. Different RS decoders in the second step 
are parametrized by different Si. The output from the L RS 
decoders forms the list of possible codewords. The maximum 
list size is seen to be 2 m . 

B. Analysis of the list decoder 

We devise a counting algorithm to calculate the fraction of 
weight-w errors correctable by C(t, t') using the proposed list 
decoder with list size set as 2 m . For w < t, the fraction is 1. 
The calculation is done for w > t. 

Let P m (w) denote the set of partitions of w into not more 
than 777 parts. Let p be the partition given by w = W\ + 
W2 + ■ ■ ■ + wi where wi > W2 > ■ ■ ■ > wi. The numbers 
wi,u>2, ■ ■ ■ ,wi denote the number of bit errors affecting I out 
of the 777 columns of the n x to codeword matrix. Equivalently, 
we can think of W\ , W2, ■ ■ ■ , wi as the weights of I out of the 
to columns of the n x to binary error matrix E. 

For a given partition p = Wi + W2 + • • • + wi of w, an 
ensemble of error patterns £(p) exists with the column weight 
distribution {wi, W2, ■ ■ ■ , wi}. The size of the set £(p) is seen 
to be 
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C zz >{t,t') 


(n, k) 


n - k + 1 


d 


d' 




Coi(6, 1) 


(255,239) 


17 


13 


4 




{1, 2, 4, 8, 16, 32, 64, 128} U {0, 1, 2, ■ ■ ■ , 11} 




Coi(8,l) 


(255,235) 


21 


17 


4 




{1, 2, 4, 8, 16, 32, 64, 128} U {0, 1, 2, ■ ■ ■ , 15} 




Cn(16,l) 


(255,221) 


35 


33 


3 




{1, 2, 4, 8, 16, 32, 64, 128} U {1, 2, 3, ■ ■ ■ , 32} 




Coi(17,l) 


(255,219) 


37 


35 


4 




{1, 2, 4, 8, 16, 32, 64, 128} U {0, 1, 2, ■ ■ ■ , 33} 




Cn(8,2) 


(255,231) 


25 


17 


5 


{1,2, 


■ ■ ■ ,64, 128, 3, 6, 12, ■ ■ ■ , 192, 129} U {1, 2, • • ■ 


,16} 


Cn(16,2) 


(255,217) 


39 


33 


5 


{1,2, 


■ ■ ■ ,64, 128, 3, 6, 12, ■ ■ • , 192, 129} U {1, 2, • • ■ 


,32} 



TABLE I 

Parameters of SRS codes with n = 255 over GF(256) 



where r is the number of distinct weights in the set of weights 
{wi, W2, ■ ■ ■ , wi}, and n; is the number of times the i-th 
distinct weight occurs in the set of weights. For instance, if the 
set of weights is {4, 3, 3, 1, 1}, then r = 3, rix = 1, n% = 2, 
and 71,3 = 2. 

Thus, the fraction of correctable errors for weight w, 
denoted f w is given by 

_ E p Pc(p)\£(p)\ 

where P c (p) is the probability that an error vector with column 
weight distribution p is correctable. 

To determine P c (p), the partitions in P m (w) are modified by 
deleting the parts that are lesser than t' to account for the BCH 
decoder. Since the list size is 2™, there exists an RS decoder 
parametrized by the set of columns corresponding to the parts 
in p of weight less than t'. For example, let tf = 1 and w = 9. 
Let p be the partition given by9 = 4 + 3 + l + l;pis modified 
as p given by p = 4 + 3. Hence, a suitable RS decoder will 
see an error matrix with column weight distribution p. Each 
partition in P m (w) is modified in a similar way to form a set 
P m (w). Let p be given by p = w\ + u>2 + • ■ • + Wk- The sum 
w = wi + u>2 + ■ ■ ■ + Wh need not be equal to w; it is less 
than or equal to w. Based on the modified partition p, we have 
four different cases. 

1) If p is empty, it implies that all elements in the partition p 
were < t' . A suitable RS decoder will output the correct 
codeword, and P c {p) = 1. 

2) If w < t, then whatever way errors are distributed along 
different columns, the total number of rows affected 
cannot exceed t. A suitable RS decoder will output the 
correct codeword, and P c {p) = 1. 

3) If w± > t > t', then more than t rows will be in error for 
all RS decoders. By the bounded-distance property, we 
assume that such error patterns can never be corrected, 
and P c (p) = 0. 

4) If p does not fall into any of the above three categories, 
the error pattern may or may not be correctable depend- 
ing on how the errors are distributed along the columns. 
For this case, a more detailed analysis is necessary. In 
this case, < P c (p) < 1. 

For Case 4 above, the computation of P c (p) is done as 
follows. An error matrix E G £(p) for p = w\ + w-2 + ■ ■ ■ + 
Wk is modeled by a discrete random process that involves k 
steps. The i-th step corresponds to the random placement of 
Wj ones in one of the m columns. Let {Y\, Y2, . . . , Yfe} be a 
sequence of discrete random variables, where Yi denotes the 



total number of nonzero rows of E after the i-th step. For 
instance, Y% denotes the number of nonzero rows of E after 
the first step, which will be w\ with probability 1. Y2 denotes 
the number of nonzero rows after the second step. Y2 takes 
values from w\ to (wi +W2) with different probabilities. The 
probability mass function (PMF) of Y2 can be determined from 
the PMF of Y\ and the value u>2. Similarly, we can find the 
PMFs of all the random variables Y\ to Yk starting from the 
PMF of Y\ and the values wi, W2, ■ ■ ■ , Wfc. Finally, 

P c (p) = Prob{r fe < t}. 

Fig. [3] shows a comparison of the 8 -error-correcting (255, 
239, 17) RS code (C(8)) over GF(256) and the (255, 239, 13) 
SRS code (C(6, 1)) over GF(256). The list decoder was sim- 



Fraction of Error Correctability for SRS(255.239,13) code 




4 5 6 7 8 9 10 11 12 13 

Total weight of the error, w 



(a) DL: list decoder of size L 




(b) Block-error rate plot 
Fig. 3. Comparison of C(6, 1) and C(8) over GF(256). 
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ulated over an AWGN channel with hard-decision decoding 
to verify the analysis. We see that the analysis matches with 
the simulated list decoder in the block-error rate plot, and the 
SRS code is competitive with the RS code of same rate down 
to a block-error rate of 1CP 10 . 

Notice that the list decoder D256 (see Fig. |3ja)) corrects 
a significant fraction of weight-7, 8 and 9 errors though the 
minimum distance of the code is 13. It is interesting to note 
that Dl fails to correct some weight-6 errors because of 
errors in the Hamming decoders in the first step of decoding. 
An important factor in successful decoding is the choice of 
suitable columns of the received vector that need to be decoded 
by a Hamming decoder. We propose to use soft information 
from the channel for making suitable decisions in the first 
stage and develop practical decoders for SRS codes. 

VII. Soft-input Decoders 

Because of the special structure of SRS codes, several 
suboptimal soft decoders of varying complexity are possible. 
We propose three types of soft-input decoders of increasing 
complexity. The codes Cn(6, 1) and Coi(6,l) are compared 
with C(8) over GF(256) in our simulations. Soft decoders for 
other codes yield similar gains. 

We assume BPSK modulation (0 — * +1, 1 — > —1) over an 
AWGN channel with variance a 2 . The standard Q function, 
defined as Q(x) = -^== J x e~^~ dx, is used in describing the 
decoder. 

For a SRS code C(t, t') of length n = 2 m - 1 over GF(2 m ), 
the received information R is a n x m real-valued matrix and 
let Ri,j denotes the value in the i-th row and j-th column of R. 
The proposed soft-input decoders work in two stages. The first 
stage decodes the columns of R according to the trace code. 
We restrict ourselves to df = 3 (Hamming code) and d' = 4 
(even-weight subcode of Hamming code) for simplicity. The 
second stage decodes the output of the first stage according to 
the ^-error-correcting RS code over GF(2 m ). 

A. Soft-guided decoders 

We begin with a low-complexity soft-input decoder, which 
we call a soft-guided decoder. In the first stage of a soft-guided 
decoder for SRS codes, hard-decision syndromes for the trace 
code (Hamming or its even-weight subcode) are computed for 
each of the m columns of R. Depending on the trace code, 
the following possibilities occur: 

1) d! = 3: If the syndrome for the i-th column is non- 
zero and indicates an error in location e and \R e ,i\ < 
A, the location is confirmed to be in error; otherwise, 
the location is assumed to be error-free. The threshold, 
denoted A, is heuristically chosen to satisfy 

"V (1 - r . QMiQ(ill 

V p( p) i-o (i) ' 

which equates the (approximate) probabilities of single 
errors resulting in no confirmation to double errors 
resulting in erroneous confirmation. 



2) d' = 4: In this case, we can detect double errors. If the 
syndrome for the i-th column is non-zero and indicates 
a double error, no error locations are confirmed. If the 
syndrome indicates an error in location e and \R e .i\ < 
A, the location is confirmed to be in error. The threshold 
A is chosen to satisfy 



n— 3 



which equates the (approximate) probabilities of single 
errors resulting in no confirmation to triple errors result- 
ing in erroneous confirmation. 
Hard decisions are made on R, and the confirmed error 
locations are flipped. The output is a single n x m binary 
matrix. Note that several other similar suboptimal first stages 
with varying complexity can be designed. 

The second stage involves one i-error-correcting bounded- 
distance RS decoder working on the output of the first stage. 
The performance of the soft-guided decoder is shown in Fig. |4] 
We see that the performance of a simple soft-guided decoder 
for the SRS code is comparable to that of the hard-decision 
decoder for the MDS RS code at the same rate. Notice that 
the code Coi(6, 1) performs marginally better than Cn(6, 1) 
because of the identification of double errors. 



HDD RS(255.239 17> 
Soft-guided C(6,1) 
Soft-guided C (6,1) 




Fig. 4. Performance of soft-guided decoder. 



B. Hybrid decoders 

In hybrid soft-input decoders, the first stage is an optimal 
soft decoder for the trace code. An efficient implementation 
for bitwise-MAP decoders for Hamming codes and their even- 
weight subcodes can be found in fl3]fl"4"l . The complexity 
of these decoders is O(nlogn), where n is the blocklength. 
These decoders are implementable in hardware through trans- 
formations such as the Walsh-Hadamard transform. We skip 
the details of the implementation. 

In the first stage, an efficient MAP decoder is run on each 
column of R to obtain log-likelihood ratios (LLRs) for each 
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bit conditioned on the received values in the corresponding 
column (for a bit in the i-th column, the received values in R { 
are used). 

After the first stage, hard decisions are made on the LLRs 
to obtain a single n X m binary matrix. The second stage is 
a i-error-correcting bounded-distance RS decoder. We readily 
see that the complexity of the first stage in hybrid decoders is 
higher than that of soft-guided decoders. 

The performance of hybrid decoders is shown in Fig. [5] We 
see that the hybrid decoders provide a coding gain of more 
than 0.6 dB over hard-decision decoders of MDS RS codes at 
the same rate. We also notice that additional gain is obtained 
by using Coi(6, 1) with dl = 4. The gain is about 0.7 dB at a 
block error rate of 10 -3 . 




5 5.5 6 6.5 7 7.5 

Eb/No in dB 



Fig. 5. Performance of hybrid decoder. 

When compared to the bit-level GMD algorithm J4j, the 
hybrid decoder appears to be simpler in complexity. The 
soft processing in bit-level GMD involves sorting, which 
is comparable to the complexity of evaluating the Walsh- 
Hadamard transform. However, the hybrid decoder uses the 
traditional Berlekemp-Massey bounded-distance decoder only 
once, while the bit-level GMD employs the Koetter-Vardy 
(KV) soft-input decoder for RS codes iteratively. 

A weakness of the hybrid decoder is that bounds for very 
low block error rates are difficult to prove, unlike the bit-level 
GMD. The error-correcting capability of SRS codes under 
hard-decision list decoding, as depicted in Fig. [3] seems to 
suggest that the performance of hybrid decoders should extend 
to lower block error rates as well. 

C. Soft decoders 

We call the most complex among the proposed soft-input 
decoders as simply soft decoders. In the first stage, we 
employ efficient implementations of the optimal bitwise MAP- 
decoders for the trace (similar to hybrid decoders). In the 
second stage, the Koetter-Vardy (KV) soft-input decoder for 
RS codes presented in J5) lfl5l is employed. The LLRs obtained 
after the first stage are converted to suitable inputs to the KV 
decoder using the methods suggested in lfl5l . We skip the 
details of the implementation, since we closely follow the ideas 
in lfT31 in our simulations. 



The performance of soft decoders is depicted in Fig. [6] 
We see that gains of about 0.8-0.9 dB over comparable hard- 




Eb/No in dB 

Fig. 6. Performance of soft decoder. 

decoded RS codes are possible with soft decoders. Gains of 
about 0.4-0.5 dB are obtained over KV soft decoding of RS 
codes of same rate. The parameter 'mmax' (from [3)) indicates 
the complexity of the second stage. 

The complexity of the proposed soft decoder is roughly 
comparable to that of the bit-level GMD decoder, provided 
the iterations of the KV soft decoder (in bit-level GMD) are 
carefully optimized. The gain of the proposed soft decoder is 
marginally better than bit-level GMD. 

In summary, for the code Coi(6,l), we observe that soft- 
guided decoders appear to be similar in performance to MDS 
RS codes at the same rate. We see gains of about 0.7 dB over 
hard-decision RS decoders with limited complexity hybrid 
decoders. For more complex soft decoders, we observe gains 
of 0.4-0.5 dB over soft KV decoders. 



VIII. Conclusion 

In this work, we proposed and studied a new approach for 
obtaining higher coding gains in situations where traditional 
Reed-Solomon codes have been used so far - namely, rate 
about 0.9 over GF(256). The approach suggests the use of 
a suitably chosen subcode of Reed-Solomon codes. This 
subcode is characterized by the property that its trace code 
has a minimum distance larger than 1 . Using the properties of 
the trace and image, we showed that additional coding gain 
can be obtained by efficiently processing soft values. Gains of 
about 0.7-0.8 dB are possible over bounded-distance decoders 
of traditional RS codes with low complexity soft decoders such 
as the proposed hybrid decoder. When compared to other soft 
decoders for RS codes in the literature, a gain of 0.4-0.5 dB 
is possible with the proposed soft decoder for SRS codes. 

This work demonstrates the practical utility of studying the 
properties of trace and image of codes over non-binary fields. 
Several avenues are possible for extending this study both from 
a theoretical and practical viewpoint. 
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